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ABSTRACT 

We present an analysis of the substructure revealed by RR Lyraes in Sloan Digital Sky Survey 
(SDSS) Stripe 82, which covers 2°.5 in declination on the celestial equator over the right ascen- 
sion range a =20. 7 h to 3.3 h . We use the new public archive of light-motion curves in Stripe 
_C . 82, published by Bramich et al. in 2008, to identify a set of high-quality RR Lyrae candidates. 

Qh' Period estimates are determined to high accuracy using a string-length method. A subset of 

q i 178 RR Lyraes with spectrally derived metallicities are employed to derive metallicity-period- 

■ amplitude relations, which are then used, together with archive magnitude data and lightcurve 
' Fourier decomposition, to estimate metallicities and hence distances for the entire sample. 

d . The RR Lyraes lie 5-1 15 kpc from the Galactic center, with distance estimates accurate to ~ 8 

per cent. The RR Lyraes are further divided into subsets of 316 RRab types and 91 RRc types 
based on their period, colour and metallicity. 

We fit a smooth density law to the distribution as a simple representation of the data. 
qq ■ For Galactocentric radii 5-25 kpc the number density of RR Lyraes falls as r~ 2A , but beyond 

| 25 kpc, the number density falls much more steeply, as r~ 4 5 . However, we stress that in prac- 

■^J" ■ tice the density distribution is not smooth, but dominated by clumps and substructure. Sam- 

' pies of 55 and 237 RR Lyraes associated with the Sagittarius Stream and the Hercules-Aquila 

, Cloud respectively are identified. Hence, ~70 per cent of the RR Lyraes in Stripe 82 belong 

■ to known substructure, and the sharp break in the density law reflects the fact that the dom- 
| inant substructure in Stripe 82 - the Hercules-Aquila Cloud and the Sagittarius Stream - lie 

within 40 kpc. In fact, almost 60 per cent of all the RR Lyraes in Stripe 82 are associated with 
the Hercules-Aquila Cloud alone, which emphasises the Cloud's pre-eminence. Additionally, 
evidence of a new and distant substructure - the Pisces Overdensity - is found, consisting 
of 28 faint RR Lyraes centered on Galactic coordinates {( « 80°, b ~ -55°), with distances 
*_| ' of ~80kpc. The total stellar mass in the Pisces Overdensity is ~10 4 M and its metallicity is 

C3 ; [Fe/H] ~ -1.5. 

Key words: catalogues - stars: photometry, astrometry, variables - Galaxy: stellar content - 
galaxies: photometry 
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1 INTRODUCTION 

In Aristotle's model of the Universe, the stars were fixed on 
a rotating sphere and eternally invariable. Stellar variability has 
been known since Fabricius' discovery of Miras in 1596, although 
the ancient Chinese and Korean astr onomers were already famil - 
iar with supernovae or "guest stars" ddark & Stephenson II 1979h . 
Proper motions were discovered in 1718 by Edmund Halley, who 
noticed that Sirius, Arcturus and Aldebaran had moved from their 
fixed positions recorded in Aristotelian cosmology. 



Despite this long history, our knowledge of both variable 
stars and high pro per motion sources remains very incomplete. As 
IPaczvnskil ( 2000) has emphasised, over 90 per cent of variable 
stars brighter than 12 mag have not been discovered. There are 
still comparatively few large archives of variable sources and, as 
a consequence, our knowledge of many classes of object, includ- 
ing novae, supernovae, RR Lyraes (the focus of this paper) and 
high proper motion objects remains limited. Indeed, the variable 
sky remains one of the most unexplored areas in astronomy, with 
the exciting possibility that even bright variable objects may corre- 



© 2004 RAS 



2 L.L. Watkins et al. 




3.0 3.5 4.0 4.5 5.0 5.5 6.0 0.1 1.0 10.0 100.0 

mean object type reduced x* 

Figure 1. Left: Cumulative distribution of mean object type in the HLC. Galaxy-like objects are assigned an object type of 3 and star-like objects are an object 
type of 6. For multiple observations, the mean indicates whether the object is mostly classified as a star or a galaxy. The cut used to define the stellar sample 
is shown as a dashed vertical line. Right: Cumulative distribution of reduced x~ for g (green, solid) and r (red, dashed) bands for all stellar objects. The slope 
of the distribution turns over at a reduced^ 2 value of ~ 3, which is taken as the boundary separating constant and variable stars; this value is marked on the 
graph as a dash-dot line. 
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Figure 2. Density plot of reduced x\ against Stetson index L g for the stel- 
lar catalogue. Pixel colour represents the number of objects in each pixel 
bin. The cuts used to extract the high quality sample of stellar variables are 
shown as white lines. 



spond to comple tely unknown astronomical phenomena (see e.g., 
|Paczvriskill200ll) . 

The modern era of massive varia bility searches begi ns with 
the microlensing surv eys like MAC HO (lAlcocketalJl993r) . EROS 
dAubourg et ai]ll993h and OGLE dUdalski et alj|l992h . Typically, 
these surveys monitored millions of stars down to V~2\ a few 
times every night over several years in the directions of the Galac- 
tic Bulge and the Magellanic Clouds. The resulting huge databases 
of lightcurves yielded information on many rare types of astro- 
physical variability. They were the first projects that harnessed the 
power of large format CCD cameras and modern computers to 
show that the acquisition, processing and archiving of millions of 
photometric measurements was feasible. The surveys were soon 
followe d by high-redshift s upernova surveys (such as High-Z SN 
Search ( Schmidt et al. 1998) and the Supernova Cosmology project 
jPerlmutter et al . 1999)), which typically had a lower time resolu- 
tion and smaller area coverage than the microlensing surveys, al- 
though much deeper limiting magnitudes. 

The Sloan Digital Sky Survey (SDSS: lYork et alj feOQO) pro- 
vides deep and homogeneous photometry in five bands in a large 



area around the North Galactic Cap, but with almost no variability 
information. The main exception is the compilation of repeat scans 
of the ~290 square degree are a - known as Stripe 82 (see e.g., 
lAdelman-McCarthv et alB oOS). The dataset has allowed the dis- 
covery of many new supernovae, which are publicise d and followed 
up spectroscopically wit h other telescopes (see e.g.. lFrieman et all 
l2008l ; lD"ildav et al]|2008h . By averaging repeat ob servations of un- 
resolved sources in Stripe 82, llvezic et all |2007) built a catalogue 
of 1 million stand ard stars with r magnitudes between 14 and 22. 
ISesar a t al. (2007|) then carried out the first analysis of ~\A mil- 
lion variable stars and quasars using a colour-colour plot to assign 
variable types. 

Recently, iBramich et al] (2008) presented a catalogue of al- 
most four million "light-motion curves" using the data available 
in Stripe 82. Objects are matched between the ~30 epochs, taking 
into account the effects of any proper motion over the eight-year 
baseline stretching back to the earliest runs in 1998. Thanks to the 
high quality of the SDSS imagin g, excellent astrom etric and photo- 
metric precision is attainable. IBramich et al.l J2008I) also provide a 
Higher-Level Catalogue which is a set of 229 derived-quantities for 
each light-motion-curve. These quantities describe the mean mag- 
nitudes, photometric variability and astrometric motion of a sub- 
set of objects whose light-motion-curve entries pass certain quality 
constraints. 

Variable stars of particular interest are RR Lyraes, which have 
often been used to identify Galacti c substructure - for example, 
in studies of the Sagittari us Stream Jlvezic et alfe OQO; Viv as et all 
|2004 iKeller et all |2008|), and the Monoceros and Virgo O ver- 
densities iVivas & Zinnll200rjj ; lKeller. Da Costa & Prioill 20091) . RR 
Lyraes are particularly useful for three reasons. First, they are con- 
stituents of the old, metal-poor halo, in which substructure is abun- 
dant. Second, they are standard candles, enabling an estimate of 
their distances to be made. Third, they are bright enough to be de- 
tected out to distances of ~130kpc in the SDSS data, giving us an 
insight into the structure of the remote Milky Way halo. 

In this paper, we select first the variabl e objects in St r ipe 82 
and then the subset of RR Lyraes, using the IBramich et alj (2008) 
Light-Motion Curve Catalogue (LMCC) and Higher-Level Cata- 
logue (HLC). ^2]discusses the selection of the variable objects and 
their properties, whilst !}3] describes the identification of the RR 
Lyrae stars and properties of the population in some detail. ^4]dis- 
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Figure 3. A sample of folded lightcurves. The period in days is recorded at the top of each plot. The solid line is the mean magnitude, whilst the dashed lines 
represent lcr deviations. In the bottom right corner, the number refers to the region in the colour-colour plot in which the lightcurve lies (see Figure lATI in 
AppendixlAl. The two rightmost lightcurves in the middle row are probable RR Lyraes. 



cusses substructure seen in the distribution of the RR Lyraes and 
we conclude in 5j5] 



2 VARIABLE STARS IN STRIPE 82 
2.1 Variable Selection 

The HLC contains 3,700,548 objects. Every object in the survey is 
assigned an object type each time it is observed: 3 if it is ga laxy- 
like and 6 if it is star-like (see § 3.1 of iBramich et al . 2008). For 
multi-epoch data, the object type averaged over all the observations 
provides a relatively reliable indicator of whether the source is star- 
like or galaxy-like. The cumulative distribution of mean object type 
in the left panel of Figure [TJ shows that ~ 55 per cent of objects in 
the catalogue are purely star-like or purely galaxy-like. To extract a 
sample of stars with essentially zero contamination from galaxies, 
we require that the mean object type is 5.5 or greater. This results 
in a "stellar" sample of 1 671 582 objects. 

For the stellar sample, a cumulative distribution of reduced^ 2 



(that is, x~ per degree of freedom) for the g and r bands is shown 
in the right panel of Figure[TJ The value of reduced x 2 at which the 
distributions turn over is ~3, which is taken as the ^ 2 -value below 
which stars are assumed to be well-modelled by a constant baseline. 
The number of objects that simultaneously satisfy reduced x 2 > 3 
in both g and r is 41 729. 

The stars with reduced^- 2 > 3 in both g and r bands are mainly 
variables, but still contain some artifacts, typically due to one or 
two outlying photometric measurements. One way to test for true 
variability is to look for correlations between different bands; a true 
variable star will usually have changes in brightness that are corre- 
lated in all bands whereas a discordant measurement may exist in 
one band only. Throughout this analysis, the Stetson index (L s ) is 
used as a measure for correlated variability between the g and r 
band data (see Stetson 1996). 

For the stellar sample, a density plot of reduced x 2 in the g 
band against the Stetson L g is shown in Figure[2] in which two dis- 
tinct populations can be discerned. The first has an almost linear 
correlation between reduced^ 2 and L g , and are almost all true vari- 
able stars. The second has a high reduced^ 2 , but L g is low, indicat- 
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Figure 4. Left: Colour-magnitude diagram (g versus u - g) for all 527 62 1 sources in the stellar sample for which high-quality u and g data exist. Right: 
Colour-magnitude diagram (g versus u — g) for the subset of 21 939 variables. 



ing that the brightness changes which give rise to the high reduced 
X 1 values are not correlated between bands. To extract a sample of 
high-quality variable stars, we impose the simultaneous restrictions 
L g > 1, reduced^ > 3 and r educed yj; > 3, togeth er with requiring 
at least 10 good epochs (see Bramich et al. 2008), leaving 21 939 
objects. We present a comparison o f the content of our variable cat- 
alogue with the earlier catalogue o f lSesaratalJj2007t) in Appendix 

El 

To show the quality of the data, a selection of folded light- 
motion curves are shown in Figure[3] from which a variety of peri- 
odic phenomena such as stellar variability and eclipses are evident. 
In particular, the two right-most images in the middle row are very 
likely RR Lyraes. From the periods and the lightcurve shapes, we 
might surmise that the first lightcurve is an ab-type RR Lyrae and 
the second is a c-type RR Lyrae. 

2.2 Variable Properties 

Colour-magnitude diagrams (g versus u — g) are plotted for both 
the stellar sample with u band data and the subset of variable stars, 
in Figure [4] For the stellar sample, we see three prominent clumps 
associated with the thin disk, thick disk and halo on moving red- 
wards in colour. For the variables, there is one prominent clump 
centered on u — g » 0.2, which is primarily associated with vari- 
able quasars. There are less prominent, but still discernible, peaks 
associated with variable stars in the thin disk, thick disk and halo. 

A crude discrimination between different classes of variable 
objects is possible in g - r versus u — g space. We find that the 
variable sample is largely comprised of stellar locus stars and 
low-redshift quasars; stellar locus stars are predominant at bright 
(g < 19) magnitudes while low-redshift quasars dominate at faint 
(g < 22) magnitudes. RR Lyraes also make a significant contribu- 
tion (see AppendixfA]for more detail on the statistical properties of 
the variable sample). 
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Figure 5. Top: Reduced proper motion diagram for all objects in the HLC 
with proper motion of S/N > 10 and \fi\ > 2 masyr - '. Bottom: Reduced 
proper motion diagram for all objects in the variable subset with S/N > 5 
and \fj\ > 2 masyr -1 . 



2.3 The Proper Motions 

The HLC offers an improvement over previous variability work 
in Stripe 82 through the availability of proper motions. The com- 
bination of stellar photometry and proper motions has proved to 
be a powerful way of classifying stars - in particular, members of 
elusive populations such as white dwarfs, brown dwarfs and wide 



binaries. Such combined catalogue s, drawn from the i ntersections 
of SDSS data with USNO-B data jMonet et al. Il2003h. have been 
constructed befo re dMunn et all 120041 : iGould & Kollmeier 1120041 : 
iKilic et al]|2006t) . Compared to such catalogues, the HLC is re- 
stricted to Stripe 82 and the proper motion sensitivity is poorer, due 
to the much shorter time baseline. On the other hand, the Stripe 
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82 photometric catalogue extends approximately 1.5 magnitudes 
deeper than the limiting magnitude of USNO-B proper motions 
(V~21). 

The reduced proper motion is defined as H = r + 5 iog/i + 5, 
where r is the apparent magnitude and yu the proper motion in arc- 
sec yr~' . The criteria for inclusion in the reduced proper motion di- 
agram in the top panel of Figure|5]is that the object lies in our stellar 
sample, that the proper motion is measured with a signal-to-noise 
ratio (S/N) >10 and that the absolute value of the proper motion 
|yu| exceeds 2masyr~'. The S/N-cut has been chosen primarily to 
ensure easy visibility of structure on the figure, whilst the proper 
motion cut enables us to excise quasars. We discern three distinct 
sequences of stars, namely Population I disk dwarfs, Population 
II mai n sequence subdwarfs and disk white dwarfs. Ividrih et al.l 
d2007t) have used this reduced proper motion diagram to iden- 
tify new ult r a-cool and halo white dwarfs in Stripe 82, Similarly, 
Smit h et all ( l2009allbh have used the same procedure to extract a 
sample of halo subdwarfs in studies of the velocity ellipsoid and 
halo substructure. The lower panel of Figure [5] shows the reduced 
proper motion diagram just for the subset of variables with proper 
motion S/N>5 and |yu|>2mas yr -1 . The variables are disproportion- 
ately drawn from the Population I disk dwarfs, although the other 
two sequences can still be seen. 
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Figure 6. Lomb-Scargle period estimates in the g and r bands for the candi- 
date RR Lyraes. The coloured lines represent resonance lines along which 
one period is a harmonic of the other: P r = P g (red), P, = P g /(l ± P g ) 
(orange), P r = (P g /1 ± 2P g ) (green), P r = P g /{P g - 1) (cyan), P r = 
Pgl(2P g - 1) (blue). 



3 RR LYRAES 

3.1 Identification of RR Lyraes 

Here, we develop the tools to extract a high quality sample of RR 
Lyraes from the variable catalogue. We begin by selecting the 873 
candidates that simulta neously satisfy all the following criteria, 
which are adapted from llvezic et alj (|2005), namely 

13.5<r< 20.7, 0.98 <u- g < 1.35, 

-0.16 <r-i< 0.22, -0.21 <i- z < 0.25, (1) 

D%" < D ug < 0.35, O™" < D gr < 0.55, 

where 

C„ s = (u-g) + 0.67(g-r)-1.07, D gr = 0A5(u-gHg-r)-0. 12. (2) 

The r band magnitudes correspond to distances ~5-130kpc. D llg 
and Dp represent slopes in the u-g and g-r colour-colour plane. 
Combined with the cut on u - g, the D„ g and D gl criteria constrain 
the RR Lyraes to a hexagonal box in colour-colour space, optimiz- 
ing the selection of RR Lyraes. The values D'" g " and D" gr " can be 
altered to adjust the completeness and efficiency of the RR Lyrae 
selection. We chose to use values D"™ = -0.05 and £>™" = 0.06, 
which would give a completeness of 100 per cent for the QUEST 
survey RR Lyraes dVivas et alj|2004llvezic et alj2005h . 

3.2 RR Lyrae Periods 

Determining periods for our RR Lyrae candidates is non-trivial. In 
general, there are 30 to 40 datapoints in a lightcurve, unevenly sam- 
pled over an eight-year baseline. From this sparsely-sampled data, 
we seek a period that is a fraction of a day. The multi-band nature 
of the SDSS survey is an advantage here, as we are able to verify 
that any period estimate we obtain in one band is consistent with 
the data in additional bands. 

The LMCC contains all the data for a given lightcurve. 
Each datapoint has a flag that is set (or unset) if the datapoint 
passes (or fails) certain quality requirements (for more details, see 
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Figure 7. A sample periodogram spectrum. The highest peak in each spec- 
tram is indicated by the blue line; the additional peaks selected by the 
method described in the text are indicated by the orange lines, with the 
eventual best-fit peak marked with a red line. At least one peak was found 
in each region delineated by the cyan lines. 

Brami ch et al . 2008). For period estimation, it is important that 
we use only the reliable data to minimise errors. In general, the 
g and r bands have smaller errors and fewer outliers and their clean 
lightcurves are the best sampled, so these bands are used together 
to estimate periods for our lightcurves. 

As a first pa ss, we run a Lomb-Scargle periodogram (see e.g., 
iPress et alj[l98g) on each of the g and r band lightcurves, taking 
care to ensure that the frequency range extends to the high frequen- 
cies (or low periods) expected for RR Lyrae stars and that the sam- 
pling rate is detailed enough to discern individual peaks. The re- 
sulting period estimates are plotted in Figure [6] First, we note that 
there are a number of candidates which have matching period esti- 
mates. A number of resonance lines - the locations at which one of 
the period estimates is a harmonic of the other - are plotted as solid 
lines in the graph. 
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Figure 8. The distribution of periods for all RR Lyrae candidates with peri- 
ods in the expected range for RR Lyraes (black, solid), for the ab-type RR 
Lyraes (red, dashed) and the c-type RR Lyraes (blue, dot-dashed). The spike 
at P < 0. 1 is due in part to S Scuti and SX Phe stars, and the population with 
the shortest periods is due mostly to eclipsing variables. 



The resonance lines are well-populated, indicating that the 
Lomb-Scargle periodogram can return harmonics of the true period 
as well as the period itself. Hence, we must consider whether the 
exact period matches are indeed cases where the true period has 
been recovered, or whether both period estimates are harmonics, 
and so on. Lightcurves with periods that do not match or do not lie 
on resonance lines could be objects that are not periodically vari- 
able - quasars, for example - or objects for which the periodogram 
has failed to recover the true period or a harmonic of the period 
in one or both cases. Not all of the period estimates lie within the 
range expected for RR Lyraes (0.2 - 0.8 days), which is probably a 
consequence of contamination in our sample. However, it is unclear 
whether we can simply reject any objects with period estimates 
larger than those expected for RR Lyraes. Certainly, some of the 
larger period estimates could be attributed to the sparse sampling 
of the lightcurves, which may generate a signal that overwhelms 
the periodic nature of the RR Lyraes. 

Accordingly, we experimented with a number of alternative 
methods, including binning, smoothing and phase dispersion min- 
imization, before concluding that a string-length technique is the 
most effective for our dataset. In outline, a string-length method 
works by phasing a lightcurve with a trial period and calculating 
the sum of the straight-line distances between consecutive points. 
The sum of these distances, or the string-length, will be minimised 
if the trial period is the true period 

We use a variation of the lLafler & Kinmanl dl965h string- 
length technique described in IStetsonl i ll996ir For each trial period, 
the string-length is computed for the g and r bands. Their sum is 
taken as the overall string-length, which is minimised to obtain a 
period estimate. Running a string-length period finder over a wide 
and finely-sampled range of trial periods is computationally ex- 
pensive. However, we can short-cut the process by restricting the 
string-length period search to a narrow range of periods, centred on 
a set of the most likely periods identified via the Lombe-Scargle 
periodogram. 

To ensure that the correct RR Lyrae periods are identified, the 
g and r band Lombe-Scargle periodogram spectra were combined 
and the highest peaks in each of the ranges P<0.2, 0.2<P<0.6, 



Figure 9. Period versus the combined g and r band amplitudes. Three se- 
lection boxes are shown: red for candidate RRabs, blue for candidate RRcs, 
and green for candidates warranting further study. 



0.6<P<1.0 and P> 1.0 days were selected. Then, four further peaks 
were selected from each of the g and r band spectra independently. 
This was done according to highest amplitude, until four distinct 
new peaks had been selected in each band. Each peak was then con- 
sidered in turn: the string-length was calculated for a narrow range 
of periods spanning the peak, the period for which the string-length 
was minimised was taken to be the best period in the vicinity of 
that peak. Finally, the period that returned the shortest string-length 
overall was adopted as the period estimate for the lightcurve. 

In Figure [8] the solid black line is the period distribution for 
all of the RR Lyrae candidates and shows four clear peaks. Mov- 
ing from high to low periods, these populations are predominantly: 
ab-type RR Lyraes (peak at ~0.6d), c-type RR Lyraes (peak at 
~0.35d), eclipses (peak at ~0. 1 8 d) and 5 Scuti and SX Phe stars 
(peak at ~0.05 d). Also present in this candidate sample, are a num- 
ber of non-periodic variables. The red dashed line is the period 
distribution for only those stars later determined to be ab-type RR 
Lyraes and the blue dot-dashed line is for those stars we later de- 
termine to be c-type RR Lyraes. 

Before proceeding any further, we clean the sample of some 
eclipsing variable stars, S Scuti stars, SX Phe stars and non-periodic 
variable contaminants by adopting a stringent cut on Stetson index 
L g >2.5. To perform subsequent analysis, we require that objects 
have a sufficient number of clean data points; that is, we impose 
further cuts on the number of clean epochs in the g and r bands: 
N g >5 and N r >5, leaving 604 candidates. 

Uncertainties in the period estimates may be attributed to two 
sources. If we assume that the estimate is indeed close to the true 
period and not a harmonic, then any error is due to the string-length 
fitting technique, estimated to be ~10~ 5 day, from analysis of the 
phased RR Lyrae lightcurves. However, for a small fraction of the 
sample, where the period estimate is a harmonic of the true period, 
the error will be ~0. 1-0.5 day. 



3.3 RR Lyrae Classification 

iBailev & Pickering! J 19021) first divided RR Lyraes into three 
classes - a, b and c - based on the appearance of their lightcurves. 
Further study of Lyraes has altered the classification instead to just 
ab and c. It is believed that RRab stars are pulsating in the funda- 
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in eqns {7) and (9) is shown. The left (right) panel plots the RRab (RRc) 
candidates with spectroscopic metallicities against the corresponding pho- 
tometric estimate. 



mental mode and RRc stars are pulsating in their first overtone (e.g., 
ISmith II 1995b . 

The two classes have somewhat different properties: RRab 
stars generally have relatively large amplitudes (of order a mag- 
nitude) and asymmetric lightcurves with a steep rising branch and 
a slow, steady decline. Their periods lie mostly in the range 0.4- 
1.0 day and studies of RR Lyraes in globular clusters have shown 
that the amplitude of RRab stars decreases with increasing period. 
RRc stars have smaller visual amplitudes (around half a magnitude) 
and their lightcurves are more symmetrical and nearly sinusoidal. 
They generally have periods of 0.24-0.5 day. 

To separate the two classes, we proceed by plotting the can- 
didates in the plane of (P,A), where P is the period and A is the 
sum of the amplitudes in the g and r band lightcurves, as shown in 
Figure|9] The red box is defined as the region 

0.43 < P < 0.85, 0.23 < A < 1.34 

2.3 <A + 3.3/><3.3 (3) 

and includes 296 RRab candidates. The blue box is defined as the 
region 

0.21 < P < 0.47, 0.25 < A < 0.65 (4) 

and includes 122 RRc candidates. 

Finally, the green boxes (not fully shown in Figure[9]for clar- 
ity, but which extend to a period of two days) include a number of 
further candidates, which we do not want to discard without further 
investigation. Objects with a period of one day are almost certainly 
spurious (the lightcurves have a sampling period of one day) and 
are removed. 

For the 43 objects that remain, there is the possibility that they 
may lie away from the concentration of RR Lyraes because of an 
incorrect period estimate. As discussed in Section l3~2l errors in the 
period estimates are small, but sometimes a harmonic of the true 
period is obtained. Hence, we revisit the period analysis to deter- 
mine whether any of the likely period peaks lie within the red RRab 
box or the blue RRc box. If more than one period peak lies within 
a box, the period with the minimum string-length is used. Any ob- 
jects for which a fitting period can be found are added to the RRab 
or the RRc candidate set as appropriate. This reclassification results 
in 330 RRab and 137 RRc candidates. 

3.4 The RRab types 

126 of our RRab and 52 RRc candidates possess SDSS spectra 
and have spectroscopic metallicity estimates. We use these objects 



Figure 11. The RR Lyrae candidate selection boxes in the u—g versus g—r 
plane; the orange (dot-dashed) box is used to select RRab types, the cyan 
(dashed) box for RRc types. The RRab (RRc) candidates that pass the pe- 
riod, amplitude and metallicity cuts are shown as red squares (blue circles). 
Also shown are suspected eclipsing variables as green triangles. 

to calibrate empirical relationships and thence derive photometric 
metall icities for the ent i re sam ple . 

Ijurcsik & Kovacsl l fl996h found that the metallicity of an RR 
Lyrae depends on the period P and the shape of the lightcurve, 
which may be parameterised via a Fourier decomposition: 

N 

f(ff)=A +Y J A i sin(i8 + (f> i ). (5) 
i=i 

The amplitudes A, and the phases 0, can then be combined as fol- 
lows: 

A; 

Ay = — , frj = ]<t>i - l<f>j. (6) 

Inspired by the analogous relation of Ijurcsik & Kovacsl dl996h . we 
use the spectroscopic metallicities and the lightcurve properties to 
derive the metallicity-period- amplitude-phase relation 

[Fe/H] = 0.845 - 4.487/> - O.1870 31 - 1.454A 31 + 2.166P0 31 
+1.563PA 3 i - 8.374P 2 - 0.081^,. (7) 

which has a typical scatter a = 0.25. Its performance is shown in 
the left panel of Fig. [10] To refine our RRab sample, we insist that 

- 3 < [Fe/H] < 0, (8) 

and apply a further restriction to trim the sample of a remaining 
few eclipsing variables by imposing the selection box in orange 
(dot-dashed lines) in colour-colour space shown in the lower panel 
of FigureQT] This leaves us with 325 RRab candidates. Also shown 
in this figure are the confirmed RRab types as red squares and likely 
eclipsing variables as green triangles. 

3.5 The RRc types 

Morg an etall d2007l) found that the metallicities of c-type RR 
Lyraes also vary with period and lightcurve shape. Owing to the 
difference in pulsation mode, and hence lightcurve shape, between 
the RRab stars and the RRc stars, a new metallicity relation must 
be derived for the RRc candidates. The lightcurves were Fourier 
decomposed and the 52 RRcs with spectroscopic metallicities were 
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property 


f&ab 






o~ c 


A g (mag) 


0.47 


0.14 


0.26 


0.05 


A r (mag) 


0.34 


0.11 


0.19 


0.03 


P (days) 


0.58 


0.07 


0.33 


0.04 


4 


363.8 


326.1 


146.6 


80.0 


XT 


250.9 


232.3 


98.5 


53.8 


h 


10.7 


5.2 


8.4 


2.5 


[Fe/H] 


-1.38 


0.16 


-1.61 


0.42 


D (kpc) 


28.9 


21.9 


21.1 


17.6 


r (kpc) 


29.2 


21.7 


21.5 


17.5 


M z 


0.44 


0.11 


0.71 


0.15 


N g 


29.6 


7.4 


29.4 


8.4 


N r 


29.6 


7.7 


29.6 


8.0 



Table 1. The means and dispersions in the properties of the RR Lyrae vari- 
ables split according to RRab and RRc types. Listed are the amplitudes in 
the g and r bands, the period, the reduced x 1 in g and r, the Stetson index, 
the metallicity, the heliocentric distance D and Galactocentric distance r, 
the absolute magnitude M. and the number of good epochs in g and r. 



used to derive the following metal licity-period-amplitu de relation, 
the form of which is suggested by iMorgan et al] (2007): 



[Fe/H] = -10.669 + 60.944P + 4.351A 21 

-23.418PA 21 - 95.344P 2 + 1.864A 2 , 



(9) 



As the right panel of Fig. [10] shows, there is a somewhat greater 
scatter of a = 0.38 in this relationship than the corresponding one 
for RRabs. 

A metallicity cut -3.75 < [Fe/H] < removes a few obvious 
outliers. As for the RRabs, we discard eclipsing variables by impos- 
ing the selection box in cyan (dashed lines) in colour-colour space 
shown in the lower panel of Figure [TT1 This leaves us with 97 RRc 
type candidates. Also shown in this figure are the confirmed RRc 
types as blue dots and likely eclipsing variables as green triangles. 

In the same way that the outlying candidates were reanalysed 
to determine whether a period could be found that placed the star 
into the RRab or RRc dataclouds, so the RRab and RRc rejects 
are also reanalysed, just in case they are misclassified RRc and 
RRab respectively. The metallicity and colour cuts described above 
then determine inclusion in the candidate sets. Finally, a visual in- 
spection of the lightcurves confirms that these are very high-quality 
samples. Judging by lightcurve shape, there are at most 21 possible 
contaminants in the RRab set and only 6 possible contaminants in 
the RRc set. The lightcurves typically possess low S/N and/or are 
poorly sampled. While many of these objects will be RR Lyrae, to 
be conservative, they are removed, giving us a final sample of 316 
RRab stars and 91 RRc stars, 407 RR Lyraes in all. 



4 SUBSTRUCTURE REVEALED BY THE RR LYRAES 
4.1 RR Lyrae Distances 

RR Lyraes are "standard candles" because they have a well-defined 
absolute magnitude, which, nonetheless, depends on metallicity. 
We calculate distances via the distance modulus: 



lOgD: 



m- - M- + 5 



(10) 



The apparent magnitudes in- come directly from the HLC. The ab- 
solute magnitudes M- are obtained via the following relation from 



ICaceres & CatelarJ J2008I) : 

M z = 1.3706 + 0.8941 logZ + 0.1315[logZ] 2 

- (2.6907 + 0.81921ogZ + 0.0664[logZ] 2 )lnC (11) 
+ (47.9836 + 31.7879 logZ + 5.2221 [log Z] 2 )[lnC ] 2 
+ (141.7704 + 100.6676 log Z + 17.4277[logZ] 2 )[lnC ] 3 
+ (0.3286 + 2.0377 logZ + 0.3882[logZ] 2 )log/> 

where P is the fundamental period, logZ = [Fe/H] - 1.5515 and 
Co = (n-g)o - (g— r)o, with the subscript denoting that the colours 
are unreddened. The intrinsic scatter in this relation is small com- 
pared to the errors. Uncertainties on the metallicities [Fe/H] and 
distances D are computed using standard methods. From this, we 
find that the distance errors are typically around 8 per cent (this 
includes the scatter due to the metallicity relationships). 

The right ascension, declination, classification, mean magni- 
tudes, amplitude, period and distance of all our 407 RR Lyrae can- 
didates are given in an accompanying electronic table. The means 
and dispersions for some useful quantities for the RRab and RRc 
subsamples are given in Table[T] 

4.2 The Sagittarius Stream, the Hercules-Aquila Cloud and 
the Pisces Overdensity 

The distribution of RR Lyraes in right ascension and distance is 
shown in Figure[l2] with the ab-types plotted as red circles and the 
c-types plotted as blue triangles. There are a number of things to 
notice. First, there are 296 RR Lyraes at right ascensions 20.7 n < 
a < 24 n , but only 1 1 1 at n < a < 3.3 n . The greatest concentration 
of RR Lyraes is in the fields c oincident with the Hercules-Aquila 
Cloud dBelokurov etail2007l) . Of course, not all these RR Lyraes 
are necessarily associated with the Cloud, as there may be contam- 
ination from the underlying smooth population associated with the 
Galactic Spheroid. It is known that the Bulge and Spheroid har- 
bour a population of RR Lyraes, distributed in a roughly spheri- 
cal manner around the Galactic Centre , with a metallicity distri- 
bution peaked at [Fe/H] ~ -1 (see e.g. IWalker & Terndrud[l99ll: 
lAlcock et al]|l998l : ICollinge et al]|2006t) . The plane of the orbit of 
the Sagittarius dwarf galaxy crosses Stripe 82, and there is a visi- 
ble overdensity of RR Lyraes at this location (a « 2''). Finally, we 
note that there are few (specifically 47) RR Lyrae at large distances 
(D> 50 kpc), of which 28 lie in a clump at a a 23.5 n .We term the 
structure the Pisces Overdensity. Distance uncertainties are shown 
as vertical bars for each RR Lyrae - although the error bars for 
the distant RR Lyrae are large enough to be visible, they cannot be 
responsible for the overdensity. 

The upper panel of Figure Q3] shows the fraction of accessi- 
ble volume as a function of Galactocentric radius r probed by our 
survey. The volumes are calculated via Monte Carlo integrations in 
which the RR Lyrae luminosity function is modelled as a Gaussian 
with mean and dispersion from Table [TJ the magnitude limits used 
were those defining our RR Lyrae sample. The survey reaches at 
least r~ 100 kpc before the accessible volume begins to decline. The 
brightest RR Lyraes in our sample have M-=0.1 and so are still de- 
tectable within r~130kpc. In classical models of the smooth halo, 
the RR Lyraes are distributed as a power- law like p ~ r~" with 
n~3. 1 (Wetterer & McGraw 1996). With no substructure present in 
Stripe 82, the right ascension-distance graph of Figure [T2] would 
look rather different. The fall-off in numbers would be steady, and 
not as sharp as the drop observed beyond D~40 kpc, which is real 
and cannot be attributed to properties of the survey. 

The presence of an edge to the RR Lyrae distribution in the 
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Substructure 


6 (deg) 


b (deg) 


D (kpc) 


r (kpc) 


P (days) 


[Fe/H] 


Hercules-Aquila Cloud 
Sagittarius Stream 
Pisces Overdensity 


[45°,79°] 
[139°, 182°] 
[63°,93°] 


[-56°, -24°] 
[-62°,-46°] 
[-60°,-46°] 


21.9±12.1 
26.1+ 5.6 
79.9±13.9 


20.1 + 11.3 
31.4± 5.6 
79.4±14.1 


0.51±0.12 
0.54±0.12 
0.56±0.08 


-1.43±0.36 
-1.43±0.30 
-1.48+0.28 



Table 2. The range in Galactic coordinates and the means and dispersions in heliocentric distance, Galactocentric distance, period and metallicity of the RR 
Lyrae variables in the Hercules-Aquila Cloud, the Sagittarius Stream and the Pisces Overdensity. 
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Figure 12. The spatial distribution of the ab-type (red circles) and the c-type 
(blue triangles) RR Lyraes. 
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Figure 13. Top: The fraction of Galactic volume probed by the RR Lyraes in 
Stripe 82 as function of Galactocentric distance r. Bottom: The cumulative 
number of RR Lyrae within radius r in Stripe 82, together with the fits given 
in eqn 1121 . 



Figure 14. The number density of RR Lyraes in the plane of Galactic longi- 
tude versus r magnitude. There are three obvious overdensities correspond- 
ing to the Sagittarius Stream, the Hercules-Aquila Cloud and the Pisces 
Overdensity. 




-601 

-40 -30 -20 -10 □ 10 20 -40 -30 -20 -10 10 20 10 20 30 40 50 60 
X [kpc] X [kpc] Y [kpc] 



Figure 15. The density distributions of RR Lyrae candidates projected onto 
the principal planes of Galactocentric (x,y, z) coordinates. Here, red rep- 
resents the high metallicity RR Lyrae population ([Fe/H] > —1.3), green 
medium (-1.5 < [Fe/H] < -1.3) and blue low ([Fe/H] < -1.5). The insets 
show each distribution plotted separately, together with a white circle cen- 
tered at the origin. There is no obvious distinction according to metallicity, 
and the low, medium and high metallicity RR Lyraes are clearly distended 
and seemingly belong to the Hercules-Aquila Cloud. In particular, none of 
the populations is distributed in a spherically symmetric manner around the 
Galactic Centre. 
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stellar halo at r ~50kpc has been proposed before bv llvezic et al.l 
(2000), using a sample of 148 RR Lyraes in SDSS commission- 
ing data. However, the same authors later applied their method 
to a la rger area of the sky and found no break until at least 
70kpc dlvezic et al]|2004l) . Vivas et al. (2006) also found no break 
before the limit of their survey at ~60kpc. So, "edge" may be too 
strong a term, but the number density profile of the RR Lyraes does 
seem to be best matched by a broken power-law, as shown in the 
lower panel of Figure [J_3] Such a parameterisation was first advo- 
cated by Saha (1985), who noticed that the RR Lyrae density fell off 
much more rapidly beyond Galactocentric radii of 25 kpc. Adjust- 
ing by the fraction of Galactic volume sampled by our survey, and 
assuming that our efficiency is e w 1 , we find that the spherically- 
averaged number density of RR Lyrae as 



n(r) = 2.6 X 



( 



23kP£V" if5<r<23kpc 



j23kpcj- if2 3 <r< iQOkpc 



(12) 



out to ~ 100 kpc, beyond which our data is highly incomplete, with 
only bright RR Lyraes detectable (see Figure [TBI. Our break radius 
of 23 kpc is very close to that proposed by Saha (1985). 

The inner power-law slope is almost the same as that found 
by Miceli et al (2008) - namely n=-2.43 - in a very large sample 
of RR Lyrae stars closer than 30 kpc in the LONEOS survey. Of 
course, formulae such as eqn d!2t are just a parameterisation of the 
data, as the RR Lyrae distribution is neither spherically symmetric 
nor smooth, but dominated by the three structures in the Stripe. The 
break at r~25 kpc is really a consequence of the fact that most of 
the RR Lyraes are in the Hercules- Aquila Cloud and the Sagittar- 
ius Stream substructures, which lie within 40 kpc of the Galactic 
centre. A similar c onclusion regardin g the importance of substruc- 
ture is reached by ISesar at all d2007h . who divide their RR Lyrae 
distribution into 13 clumps, of which they suggest at least seven 
correspond to real substructures. 

In Figure [14] the number density of RR Lyraes is plotted in 
the plane of Galactic longitude versus i band magnitude. The three 
substructures show up very clearly, together with some isolated hot 
pixels that may be indicators of real objects. We define the Sagit- 
tarius Stream RR Lyraes via 



180° > C > 135° 



16.5 < r < 18.5 



(13) 



RR Lyraes associated with the Hercules-Aquila Cloud are extracted 
via 



80° > I > 45°, 



14.5 < r < 20, 



t + I5r < 358 



(14) 



For the Pisces Overdensity, we chose the stars satisfying 

95°>f>60°, r>19, ^+15r> 358. (15) 

These cuts give 55 stars in the Sagittarius Stream, 28 in the Pisces 
Overdensity, and 237 in the Hercules-Aquila Cloud. 

Ideally, we would like to separate any contaminating Bulge 
and Spheroid RR Lyrae from those of the Hercules-Aquila Cloud, 
but this is not easy. In particular, Figure [15] shows the density dis- 
tribution of the RR Lyrae populations colour-coded according to 
metallicity. The comparatively metal-rich RR Lyraes (red in the 
Figure) do not seem to be distributed any differently from the com- 
paratively metal-poor (green and blue). In fact, all the distributions 
are distended and distributed asymmetrically relative to the Galac- 
tic Centre, consistent with the bulk of the stars belonging to the 
Hercules-Aquila Cloud. 

The properties of the RR Lyrae in the different substructures 



are listed in Table[2] Note that very nearly 60 per cent of all the RR 
Lyraes in Stripe 82 are associated wi th the Hercules-Aquila C loud, 
emphasising the arguments made bv lBelokurov et al. I ( 120070 as to 
the importance of this structure. The mean heliocentric distances of 
the Hercules-Aquila Cloud and the Sagittarius Stream are compa- 
rable in Stripe 82, but the Pisces Overdensity is much further away 
at D~80kpc. The Pisces Overdensity lies within a few degrees 
of the Magellanic Plane. Although the distance of the Overden- 
sity is greater than that of the Large and Small Magellanic Clouds 
(D~55 kpc), it is possible that the Magellanic Stream may be more 
distant in this area of the sky. Thus, at present, it is unclear whether 
the Pisces Overdensity is related to Magellanic Cloud debris. 

One way to estimate the mass of the Pisces Overdensity is 
to compare with the Carin a dwarf sph eroidal, which is at a sim- 
ilar distance (~100 kpc; lMateolll998h . Carina has a total mass 
of ~ 2 x 10 7 M o and a mass-to-light ratio of ~70. At leas t 75 
RR Lyrae stars have been detected by iDall'Ora et alj d20030 us- 
ing well-sampled multi-epoch data in the B and V bands, although 
over a small baseline of a few days. Assuming similar stellar pop- 
ulations and similar efficiencies of detection of bright RR Lyrae 
in the surveys, then we can use a simple scaling argument to esti- 
mate the total stellar mass associated with the Pisces Overdensity 
as ~ 10 5 M o . We can corroborate this mass estimate by comparison 
with our data on the Hercules-Aquila Cloud. The calculation using 
the Hercules-Aquila Cloud has the advantage that the RR Lyrae 
populations in the two structures have been discovered by the same 
algorithm, but the disadvantage that the properties of the Cloud are 
al so rather uncertain. The absolute magnitude of the Cloud is given 
bv lBelokurov et aL as M, =-13, suggesting that its total stel- 

lar mass is ~ 10 7 M o . Of course, the Cloud is an enormous structure, 
covering ~80° in longitude and probably extending above and be- 
low the Galactic plane by 50°. Only a small fraction (~ 1 per cent) 
of the Cloud is probed by the Stripe 82 data, suggesting that there 
must be ~ 2x 10 4 RR Lyraes associated with the Cloud in total. The 
mass of the Cloud covered by the Stripe is ~ 10 5 M o . Again, assum- 
ing similar stellar populations, the mass associated with the Pisces 
Overdensity is at least ~ 10 4 M G , a value encouragingly similar to 
our first estimate. 

Of the 28 stars identified as members of the Pisces Overden- 
sity, the intrinsically faintest has M z = 0.76. The stars extend over 
an area of 55 deg 2 of Stripe 82. Thus, the surface number density 
of the RR Lyrae is 0.5 1 deg~ 2 . By comparison, the Hercules-Aquila 
Cloud has 209 RR Lyraes with an absolute magnitude brighter than 
M ; =0.76. They cover 95 deg 2 of Stripe 82, and hence the surface 
number density is 2.20 deg 2 . This suggests that the Pisces Overden- 
sity is ~4.33 times more diffuse than the Hercules-Aquila Cloud. 

The distance and metallicity distributions of our RR Lyraes 
can be used to study the properties of the Hercules-Aquila Cloud, 
the Sagittarius Stream and the Pisces Overdensity. Plots are shown 
in Figure Q~6] The distribution of heliocentric distances for the 
Cloud has a mean of 22.0 kpc and a standard deviation of 12.1 kpc. 
One possible interpretation of the Cloud is that it is analogous to 
caustic features like the shells seen around elliptical galaxies. How- 
ever, the considerable depth of the Cloud seen in the RR Lyrae dis- 
tribution tends to argue against such an interpretation as a caustic 
structure. 

In directions towards Stripe 82, the distances of the arms of 
the Sagittarius Stream are not well-known. Simulations offer a 
rough guide, but no m ore than that. The upper panel of Figure 3 of 
Fellh auer et all (2006) shows the young leading arm (A), together 
with parts of the old trailing arm (B), at distances of 15-20 kpc; fur- 
ther, material belonging to parts of B, the old leading arm (C) and 
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Figure 16. The distributions of distances (top) and metallicities (bottom) of RR Lyraes in the Hercules-Aquila Cloud (left) Sagittarius Stream (centre) and the 
Pisces Overdensity (right). 



the young trailing arm (D) spread out over a swathe of distances 30- 
60 kpc. The distribution of distances of our Sagittarius RR Lyraes 
in Figure [16] does indeed show some evidence of bimodality. It 
is possible that the peak at distances £>~20kpc corresponds to the 
A and B streams, whilst the peak at £)~35 kpc corresponds to the 
other wraps. However, the simulations also suggest that the second 
peak should be much broader than appears to be the case in the data. 
Our identifications are tentative and radial velocity data is required 
to enable a cleaner separation of the wraps, as is evident from the 
lower panel of Figure 3 of Fellhauer et al. (2006). 

From Table [2] we see that the RR Lyrae in the Hercules- 
Aquila Cloud have a metallicity [Fe/H]=-1.42±0.24, whilst those 
associated with the Sagittarius Stream have [Fe/H]= -1.41+0.19. 
The Pisces Overdensity has a metallicity [Fe/H]= -1.47+0.34, com- 
parable to the Hercules-Aquila Cloud, but richer than the typical 
p opulations in the out er halo, which have a metallicity of [Fe/H] — 
2 dCarollo et ai]|2007h. 

In fact. I Vivas etZI J2005h have already carried out VLT spec- 
troscopy of 14 RR Lyrae variables that lie in the leading arm of 
the Sagittarius Stream, finding a metallicity of [Fe/H]=-1.76±0.22. 
The stars lie well away from Stripe 82 at right ascensions 13 h < 
a < 16 n and at heliocentric distances of ~50kpc. Many of the RR 
Lyraes in our sample will belong to the trailing arm, which may 
account for some of the difference in the metallicity estimate. 

We show a view of Stripe 82 as derived from SDSS main- 
sequence turn-off (MSTO) stars in Figure[T7] The upper panel gives 
the number of MSTO stars as a function of right ascension and 
distance. The one-dimensional histogram plotted in black shows 
the dependence of number on right ascension alone. Note that the 
Sagittarius Stream is immediately visible at a ss 40°. There are 
two density maxima in the black histogram, perhaps hinting that 
more than one wrap of the Stream is detectable in MSTO stars. The 
Hercules-Aquila Cloud substructure also shows up very clearly, al- 
though the fainter Pisces Overdensity is understandably absent. The 
distance estimates to the substructures derived from MSTO stars 
agree well with those from RR Lyraes. The SDSS DR6 includes a 
large number of stellar spectra which have been analysed to pro- 
vide r adial velocities and fundamental stellar atmospheric param- 
eters dLee et al]|2008l) . The velocities of all stars with spectra and 



satisfying g - i < 1, to remove most of the thin disk contaminants, 
are plotted against right ascension along Stripe 82 in the middle 
panel. Of course, most of the stellar targets are disk stars, and so 
the curve v G sr =190 cos b sin I kms~' is plotted to show the locus 
of the thick disk in this dataset. The Sagittarius Stream stars are 
clearly offset in velocity from the thin disk at vgsr ~ -130 kms~'. 
The bottom panel shows the same data, but now colour-coded ac- 
cording to metallicity so as to highlight different structures. We can 
detect the kinematically bifurcated Sagittarius Stream and clearly 
see the separation of the more metal-rich Galactic disk and bulge 
stars from the Hercules-Aquila Cloud. The eye can also discern 
some fainter substructure, the reality of which remains to be estab- 
lished. 

A final view of Stripe 82 is provided in Figure [T8j which 
shows the density distribution of blue horizontal branch stars 
(BHB s) and blue stra gglers (BSs), selected using the colour cuts 
of lYannv et ail feoOO). The BHB population of course abut the RR 
Lyrae population in the Hertzsprung-Russel diagram. We might ex- 
pect to see all three substructures in the BHB density - and so it is 
reassuring that the Sagittarius stream, the Hercules-Aquila cloud 
and the Pisces Overdensity are all visible. The same substructures 
are also identifiable in the BS populations with the exception of the 
Pisces Overdensity which is of course too distant. The Sagittarius 
Stream is clearly bifurcated in BHBs, although not in BSs, suggest- 
ing that the ratio of BHBs to BSs varies along the Stream. There is 
some evidence for bimodality in the BHB distance distribution of 
the Hercules-Aquila Cloud. 



5 CONCLUSIONS 

We have constructed a catalogue of 21 939 variable objects in Stripe 
82. The catalogue of variables is published in full as an electronic 
supplement to this article. We have extracted a sample of RR Lyrae 
stars, 316 RRab types and RRc types, from the variable catalogue, 
using a combination of cuts based on colour, period, amplitude 
and metallicity. The RR Lyraes lie at distances 5-1 15 kpc from the 
Galactic centre and individual distance estimates, accurate to typi- 
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Figure 17. Top: The distributions of main sequence turn-off stars in Stripe 
82 shown in the plane of right ascension versus heliocentric distance. The 
black histogram shows the one-dimensional distribution as a function of 
right ascension only. Middle: The velocity distribution of all stars with 
g - i < 1 with SDSS spectra. The white dashed curve shows the line 
190 sin C cos b kms~' and marks the expected locus of stars belonging to 
the thick disk. Most thin disk stars are excised by the colour cut. Bottom: 
As middle, but the distribution is now colour-coded according to metallicity 
with red representing [Fe/H] > - 1 , green - 1 . 67 < [Fe/H] < - 1 and blue low 
[Fe/H] < -1.67. This separates the thin disk stars (reddish) from the older 
and metal-poor components of the stellar halo, such as the Hercules-Aquila 
Cloud. 



cally 8 per cent, are calculated using the colour, period and metal- 
licity to estimate absolute magnitude. 

If the RR Lyrae data are modelled by a smooth density dis- 
tribution, then a good fit is provided by a broken power-law. The 
number density of RR Lyrae falls with Galactocentric radius r like 
n(r) ~ r~ 1A for 5 < r < 23 kiloparsecs, switching to a much steeper 
decline, nij) ~ r~ 4 - 5 for 23 < r < 100 kiloparsecs. However, 
smooth, spherically-averaged density laws do not tell the whole 
story, as in reality the RR Lyrae distribution is strongly clumped. 
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Figure 18. The distributions of BHB stars (left) and blue stragglers (right) in 
Stripe 82 shown in the plane of right ascension versus heliocentric distance. 



In Stripe 82, the distribution of RR Lyraes is dominated by three 
enormous substructures - namely, the Hercules-Aquila Cloud, the 
Sagittarius Stream and the Pisces Overdensity. We identified sam- 
ples of 237 RR Lyraes in the Hercules-Aquila Cloud, 55 stars in the 
Sagittarius Stream and 28 in the Pisces Overdensity. 

RR Lyraes belonging to the Hercules-Aquila Cloud are very 
numerous, and comprise almost 60 per cent of our entire Stripe 82 
sample. Although there may be some contamination from a smooth 
component of RR Lyraes associated with the Galactic Bulge and 
Spheroid, there is no doubt concerning t he existence of the struc - 
ture, supporting the initial identification o flBelokurov et al."ld2007t) . 
We estimate that the total number of RR Lyraes associated with the 
Cloud is 2 x 10 4 . The Hercules-Aquila RR Lyraes lie at distances 
from the Galactic Centre of 20.2± 11.3 kpc, and are metal-poor with 
[Fe/H] = -1.42 ±0.24. 

Both leading and trailing arms of the Sagittarius Stream also 
intersect Stripe 82. Simulations predict that the leading wrap is 
closer in heliocentric distance than the trailing, but the locations 
of the arms are not accurately known in this region of the sky. The 
heliocentric distances of our Sagittarius RR Lyraes, which predom- 
inantly are associated with the trailing arm, have a mean of 26.2 kpc 
and a dispersion of 5.5 kpc, whilst their metallicity is [Fe/H] = 
-1.41 ±0.19. 

We have also identified a new concentration - the Pisces Over- 
density - consisting of 28 RR Lyraes centered on Galactic coordi- 
nates of (£ x 80°, b x -55°). This is one of the most distant clumps 
so far found in the halo, as the RR Lyrae lie at distances of ~80 kpc. 
Although the location is close to the Magellanic Plane, the Pisces 
Overdensity is much more distant than the Magellanic Clouds and 
may well be unrelated to any known component of the Galaxy. We 
have made an order-of-magnitude estimate of the total mass asso- 
ciated with the Overdensity as at least ~ 10 4 M o . The associated 
RR Lyrae have a metallicity [Fe/H]=-1.47±0.34, comparable to the 
Hercules-Aquila Cloud, but richer than the typical populations in 
the outer halo. 

Our investigation has exploited the advantages of RR 
Lyrae stars for identifying remnants and substructure present 
in the halo of the Galaxy. Together with earl ier SDSS discov- 
eries ( iBelokurov et al. 1 120071 ; ljuric et al.l [2008), the patchy and 
clumpy nature of the RR Lyrae distribution adds support to the pic- 
ture of an outer halo composed of overdensities and voids, perhap s 
entirely devoid of any smooth component (e.g. Bell et aini2008h . 
Further study of the kinematics and metallicities of RR Lyraes in 
Stripe 82 should lead to a major advance in our understanding of 
the Galactic halo, albeit that significant observational resources will 
be required to acquire the necessary follow-up spectroscopy. 
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APPENDIX A: COMPARISON WITH OTHER 
VARIABILITY SURVEYS IN STRIPE 82 

llvezic etal.U2007l) constructed a catalogue of one-million standard 
stars with r magnitudes 14.0-22.0, by averaging repeated obser- 
vations of unreso lved sources in the 290 deg 2 area of Stripe 82. 
ISesar a t al. (2007) used the catalogue to carry out the first analysis 
of variability in Stripe 82. In particular, they applied cuts x' r > 3 
and;^ 2 > 3, followed by the requirement that the root-mean-square 
scatter exceeded 0.05 mag, to identify variability, obtaining a cata- 
logue of 20 533 variable sources. 

Bramich et al.'s (2008) light-motion curve catalogue (LMCC) 
is based on observations of Stripe 82 restricted to a smaller area of 
249 deg 2 , extending in right ascension from 20.7 h < a < 3.3 h with 
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Figure Al. Colour-colour plots for 4 648 variables brighter than g = 19.0 (left) and 21 789 variables brighter than g = 22.0 (right). The upper panels are g - r 
versus u - g, the lower panels are r - i versus g-r. Sesar et al. (2007) label these regions as white dwarfs (Region 1, red), low-redshift quasars (Region 2, 
orange), M dwarf/white dwarf binaries (Region 3, green), RR Lyraes (Region 4, cyan), main stellar locus (Region 5, blue) and high-redshift quasars (Region 
6, purple). 



g < 19 g < 20.5 g < 22 



Region 


Name 


% all 


% var 


% all 


% var 


% all 


% var 


1 (red) 


white dwarf s 


0.09 


0.43 


0.16 


0.33 


0.20 


0.36 


2 (orange) 


low-redshift quasars 


0.29 


21.32 


1.20 


52.96 


5.31 


61.05 


3 (green) 


dM/WD pairs 


5.95 


1.01 


9.51 


1.59 


11.77 


3.65 


4 (cyan) 


RR Lyrae stars 


3.27 


15.23 


3.62 


9.31 


3.47 


7.85 


5 (blue) 


stellar locus stars 


75.93 


56.41 


74.62 


31.44 


69.65 


22.48 


6 (purple) 


high-redshift quasars 


14.47 


5.59 


10.89 


4.37 


9.61 


4.62 




total count 


283,899 


4,648 


447,800 


12,788 


518,058 


21,789 



Table Al. The distribution of candidate variable sources in the g-r versus u — g diagram. The columns list the fraction of the whole sample and the variable 
subsample lying in the six regions of the colour-colour plot. 



a width 2°52 in declination from 6 = - 1°26 to 1°26. We extracted a 
sample of high-quality variable stars from the LMCC by imposing 
the restrictions that: i) xl > 3 and^ > 3, ii) a cut on the Stetson 
index L g > 1, iii) at least 10 good epochs are retained, giving a a 
catalogue of 21 939 variable objects. Applying Sesar et al's cuts to 
our catalogue gives 22 483 objects, with w 80% in common with 
our subsample based on Stetson index cuts. Even though Sesar et 
al.'s cuts give more candidates, the additional objects possess vari- 
ability in different passbands that is not well-correlated. 

Sesar and co-workers used a colour-colour plot to discrimi- 
nate between different classes of variable objects. In Figure lATI 
our stellar subsample is plott ed in g - r versu s u — g and r - i 
versus g - r. Here, following ISesar at alj d2007h . the e — r versus 
u — g plot is divided into six regions, and labelled according to pos- 
sible occupants: white dwarfs (the red-coloured Region 1), low- 



redshift quasars (the orange-coloured Region 2), M dwarf/white 
dwarf binaries (the green-coloured Region 3), RR Lyraes (the cyan- 
coloured Region 4), stellar locus stars (the blue-coloured Region 
5) and high-redshift quasars (the purple-coloured Region 6). The 
colour-space divisions provide only very rough classifications. In 
some cases (such as region 1), the label does not even describe the 
typical population, and we merely use the labels as a point of com- 
parison to Sesar's work. 

The percentages of the variable subsample and the whole sam- 
ple lying in the regions of the colour-colour plots are given in 
Table lATI whilst sample lightcurves have already been shown in 
Figure [3] Almost all (>93 per cent) of the variable objects lie in 
three regions - namely, low-redshift quasars (53 per cent of the 
catalogue), stellar locus stars (3 1.4 per cent), and RR Lyrae stars 
(9.3 per cent). When split according to magnitude, the bright (g< 
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Figure A2. The spatial distribution of the variable subsample in Stripe 82. Objects are colour-coded according to the Regions of the colour-colour plot in 
which they lie (see Figure IaTV The upper panel shows the number of all the variable objects versus right ascension. 

19.0) variable-sky is dominated by stellar locus stars, but the faint 
(g<22.0) variable-sky is dominated by qu asars. We can compare 
our results to Table 1 of (Sesar at al. 2007), which shows the same 
quantities for their variable subsample. Our variability criteria picks 
out more variable objects, and in particular more denizens of the 
main stellar locus. 

The spatial distribution of variable objects in Stripe 82 is 
shown in Figure IA2I The equatorial stripe reaches down to low 
Galactic latitudes beyond a s= 18 h (see e.g. Figure 1 of Belokurov 
et al. 2007). Variables belonging to Region 4 (RR Lyraes) and Re- 
gion 5 (the main stellar locus) dominate here, whereas variables 
belonging to the other Regions are more uniformly dispersed in 
right ascension. 
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